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Abstract. An analysis of the IRAS 1.2 Jy redshift cata- 
logue with emphasis on the separate examination of north- 
ern and southern parts (in galactic coordinates) is per- 
formed using a complete set of morphological descriptors 
(Minkowski functionals), nearest neighbour distributions 
and the variance of the galaxy counts. We find large fluc- 
tuations in the clustering properties as seen in a large 
difference between the northern and southern parts of the 
catalogue on scales of 100/i _1 Mpc. These fluctuations re- 
main discernible even on the scale of 200/i~ 1 Mpc. We also 
identify sparse sampling as a major source of "apparent 
homogenization" . Tests on observational selection effects 
concerning luminosity, colour and redshift-space distor- 
tion support the significance of these large-scale fluctua- 
tions. 

Key words: Cosmology: large-scale structure of the Uni- 
verse - theory - observations - Galaxies: clustering 



1. Introduction 

The spatial distribution of luminous matter seen in the 
available galaxy redshift catalogues conveys a complex pic- 
ture of voids, isolated galaxies, clusters and superclusters, 
with the largest filamentary features extending on scales 
comparable with the size of the surveys. Since these find- 
ings are expected to provide important constraints for the- 
oretical models of structure evolution, a prime task is to 
extract the morphological characteristics of the observed 
patterns by appropriate quantitative measures. 

In t he present paper w e consider the IRAS 1.2 Jy cat- 
alogue ( Fisher et al. 1995 ). This infrared-selected survey 
consists of two data sets for the northern and southern 
caps (in galactic coordinates) separated by a 10 degree 
wide zone of avoidance and is therefore nearly all-sky. We 
analyze the two subsamples individually with Minkowski 
functionals (Mecke et al. 1994) in order to quantify the 



global geometrical and topological features of spatial pat- 
terns. In addition, we apply a novel method of cluster 
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diagnostics (van Licshout and Baddeley 1996) combining 
local and global statistical descriptors, and the more con- 
ventional count-in-cell method. 

Practically, when analyzing galaxy surveys one has to 
deal with various technical problems. As a rule, three- 
dimensional galaxy catalogues are constructed from red- 
shift surveys which suffer from distortions due to peculiar 
motions. To date, there is no way of tackling this problem 
without making strong assumptions (e.g. linearized equa- 
tions of motion), and therefore we conduct our analysis in 
redshift space and investigate the differences to real space 
by comparing with results of A-body simulations. Fur- 
thermore, surveys are usually flux limited. Therefore, the 
observed number density of the galaxies decreases with 
distance. For samples with inhomogeneous mean number 
density the methods employing morphological estimators 
are usually dominated by thinning at the fringes. One rem- 
edy is to focus on volume limited subsamples, at the cost 
of losing a substantial fraction of galaxies. Nevertheless, 
we show that Minkowski functionals provide significant 
results even for a small number of galaxies. Finally, all 
surveys are spatially limited both in directions as well as 
in depth, where volume limitation leads to a well-defined 
spherical boundary. Hence, we have to take care of bound- 
ary constraints. In Appendix A we discuss these finite size 
effects on the estimators for the Minkowski functionals, 
for the nearest-neighbour distribution, and for the fluctu- 
ations of the galaxy counts. 

As a result we detect a significant morphological seg- 
regation between the northern and the southern part of 
the IRAS catalogue. These findings are also supported by 
our further results on the variances of counts in cells, done 
separately for the north and the south. 



2. Methods of morphological analysis 

We consider a set X = of N points X; S M 3 given 

by the redshift-space c oordinates of the galaxies in the 
IRAS 1.2 Jy catalogue ( [Fisher et al. 1995| ). 
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Table 1. Minkowski functional in three-dimensional 
space expressed in terms of more familiar geometric quan- 
tities. 



geometric quantity 



M u 



V 
A 
H 

X 



volume 

surface 1 

integral mean curvature 2 

Euler characteristic 3 



V 

A/8 
3x/(4tt) 



V/(^-r 3 N) 
A/{4ivr 2 N) 
H/{A-kvN) 
X/N 



2.1. Minkowski junctionals 

A computationally convenient and robust method for 
quantifying the morphology of spatial patterns is offered 
by Minkowski functionals as they were introduced into 
cosmology by Mecke et al. (1994). For this purpose we 
decorate each point Xj with a ball B r (xi) of radius r 



and then consider the union set Ajy(r) — \J i — Q B r ('Xi) 



Hadwiger (1957) proved that in three dimensions the four 



Minkowski functionals M (Ll= o,i ) 2,3(.Azv(r)) give a complete 
morphological characterization of a body An(t"). The in- 
terpretation of these functionals in terms of geometrical 
and topological quantities is given in Table (|l|). 

By normalizing with the functional M^(B r ) of a 
single ball we can introduce normalized, dimensionless 
Minkowski functionals $ ^(An (r)) , 



M^A N (r)) 
NM^Br) ' 



(1) 



In the case of a Poisson process the exact mean values of 



the functionals can be calculated analytically (Mecke and 



Wagner 1991). For decorating spheres with radius r one 



obtains: 



(l 



(2) 



(1 



32 

3ri 



32 



with the dimensionless parameter r\ = nAlo(B r ) = 
n 47rr 3 /3, where n denotes the mean number density. 

For /i > 1 the measures $ M (.4Ar(r)) contain the expo- 
nentially decreasing factor e^ 71 ^). It can be shown that 
this exponential decrease with increasing (diagnostic) ra- 
dius r also arises for more general cluster processes. There- 
fore we employ the reduction 



M> 1, 



(3) 



and thereby remove the exponential decay and enhance 
the visibility of differences in the displays shown below. 



2.2. Nearest neighbour statistics 

The nearest neighbour distributio n G(r) is a sta ndard tool 



in the analysis of point processes ( Ripley 1986 ) and is de- 
fined as the distribution of distances r of a point of the 
process to the nearest other point of the process. The func- 
tion G(r) is related to conditional correlation functions 
flWhite 1979j ). Another common statistical descriptor is 
the empty space function F(r), the distribution of the dis- 
tances r of an arbitrary point to the nearest point of the 
process, and is equal to the expected fraction of volume 
occupied by points which are less distant than r from the 
next point of the process. Therefore, F(r) is equal to the 
volume density of M {An{t))\ hence 1 — F{r) coincides 
with the void probability function Po(r) (White 1979). Al- 
ternatively, one may interpret F (r) as the unconditional 
and G{r) as the conditional distribution of the nearest 



neighbour distance. Recently, van Licshout and Baddeley 
|(1996) advocated to use the ratio 



J(r) := 



1 - G(r) 
1 - F{r) 



(4) 



as a probe for clustering of a point process. In the case of 
a Poisson process, 



G p (r) = 1 - exp(-7 7 ) = F p (r), 



(5) 



and thus J F {r) = 1. For a process with enhanced clump- 
ing, we have J(r) < 1, whereas regular structures are in- 
dicated by J(r) > 1. 

2.3. Fluctuations of the galaxy counts 

Another frequently used method for exploring galaxy cat- 
alogues is to consider the fluctuations of galaxy counts, 
in particular, the variance of counts in cells in excess of 
Poisson, 



((Ni ~nVf) =nV + n 2 V 2 cr 2 



(6) 



where Ni is the number of galaxies in a volume V, in our 
case a ball B r with radius r; the averaging is performed 
over random positions of the balls. For a Poisson process, 
a 2 = by definition. For stationary point processes the 
variance a 2 is related to the two-point correlation function 
f(r), 



y\)- 



(7) 



After averaging a power-law correlation function £(r) = 
(r /r) 7 over balls B r one obtains (Peebles 1993): 



(l- 7 /3)(l- 7 /4)(l-7/6) 



(8) 



In Appendix B we discuss the dependence of the variance 
c on the mean number density and we point out how this 
has to be modified for fractal sets. 
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3. Results 



We now apply the methods introduced above to explore 
a redshift catalogue of 5313 IRAS selected galaxies with 
limiting flux of 1.2 Jy (Fisher et al. 1995). 



3.1. Volume limited samples with lOOh 1 Mpc depth 

A volume limited sample of 100/i -1 Mpc depth (with h de- 
fined by Hq = 100ft- km s _1 Mpc -1 ) contains 352 galaxies 
in the northern part, and 358 galaxies in the southern 
part. In Fig. |l| we see that the redshift and flux distri- 
butions of the galaxies do not indicate any peculiarity or 
problem with the sample. As far as the number density, i.e. 
the first moment of the galaxy distribution is concerned, 
the sample does not reveal differences between north and 
south. However, we want to assess the clustering proper- 
ties of the data and, above all, tackle the question whether 
the southern and northern parts differ or not. More refined 
measurement of the galaxy distribution nee d to incorpo - 
rate at least the second moment (see Sect. 3.1.5 , 3.2. 2|) . 



A complete characterization of morphology, depending on 
all moments of the distribution, is even more desirable. It 
is provided by the Minkowski functionals. 

3.1.1. Minkowski functionals 

In Fig. H we show the values of the Minkowski function- 
als of the southern and northern parts, together with the 
values from a Poisson process with the same number den- 
sity (the volume density is shown in Fig. |^). The errors 
for the Poisson process were calculated using twenty dif- 
ferent realizations. This already gives a stable estimate of 
the ensemble variance. To estimate the error for the cat- 
alogue data we calculated the Minkowski functionals of 
twenty subsamples containing 90% of the galaxies, ran- 
domly chosen from the volume limited subsamplc. 

In both parts of the 1.2 Jy catalogue the clustering 
of galaxies on scales up to 10/i _1 Mpc is clearly stronger 
than in the case of a Poisson process, as inferred from the 
lower values of the functionals for the surface area, fa , the 
integral mean curvature, fa, and the Euler characteristic, 
</>3. Moreover, the northern and southern parts differ sig- 
nificantly, with the northern part being less clumpy. The 
most conspicuous features are the enhanced surface area 
fa in the southern part on scales from 12 to 20/i~ 1 Mpc and 
the kink in the integral mean curvature fa at 14/i _1 Mpc. 
This behavior indicates that dense substructures in the 
southern part are filled up at this scale (i.e. the balls in 
these substructures overlap without leaving holes), and is 
presumably the signature of the Perseus-Pisces superclus- 
ter (for a more detailed analysis see Kerscher et al. 19"96f ) . 
On scales from 15 to 20/i _1 Mpc, fa, the value of the inte- 
gral mean curvature is negative, indicating concave struc- 
tures. In the southern part the Euler characteristic is still 
negative in this range; therefore, the structure is domi- 
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Fig. 1. In the top panel we show the histograms of the 
redshift distribution of the galaxies for the volume limited 
sample with 100/i _1 Mpc depth. In the bottom panel we 
show the histograms for the flux distribution. The north- 
ern part is shown as a solid line, and the southern part as 
a dashed line. 



nated by interconnected tunnels (giving rise to negative 
contributions to the Euler characteristic) rather than by 
completely enclosed voids (these would result in positive 
contributions to the Euler characteristic). A similar fea- 
ture is seen in the Minkowski functionals of Abell/ACO 
clusters (Kerscher et al. 1997). 



3.1.2. Error estimates 

To get an impression how the error from subsampling is 
related to the intrinsic variance of an ensemble we looked 
at fixed realizations of a Poisson process within the sample 
geometry and calculated the error via subsampling using 
again 90% of the points. This error turns out to be two 
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Fig. 2. Minkowski functionals M of a volume limited 
sample with 100/i _1 Mpc depth; the dark shaded areas rep- 
resent the southern part, the medium shaded the north- 
ern part, and the dotted a Poisson process with the same 
number density. The shaded areas are the la errors as 
explained in the text. 



times smaller than the ensemble error calculated over dif- 
ferent realizations of a Poisson process. 

As for a further error estimate we randomize the red- 
shifts using the quoted redshift errors as the standard 
deviation (and using the mean redshift error, if none is 
quoted). These errors are found to be approximately two 
times smaller than the errors from subsampling as shown 
m Fig. | Even if we increase the quoted redshift error by 
a factor of five the errors in the functionals are of the same 
order as determined from subsampling. 



In using a window as shown in Fig. Al we take care 
of the zone of avoidance. There are additional holes in the 
redshift catalogue due to a lack of sky coverage or con- 
fusion in the point source catalogue. In the northern part 
these holes account for 3.2% of the catalogue, in the south- 
ern part they account for 4.5%. To estimate the influence 
of these regions on the morphological measures we throw 
Poisson distributed points with the same number density 
into these regions. The additional error introduced from 
these random points is smaller than the error from ran- 
domizing the redshifts and therefore, much smaller than 
the errors from subsampling. Moreover, no systematic ef- 
fect is seen, the curves overlap completely. 

3.1.3. Selection effects and the effect of sparse sampling 

Several selection effects might enter into the construction 
of the catalogue. Therefore, we draw subsamples selected 
according to special features like "colours" , higher limiting 
flux, and different boundaries. 

Before listing these tests we address the important is- 
sue of 'sparse sampling'. This issue is especially relevant 
for the interpretation of our tests, since all these selected 
samples incorporate less galaxies than the volume limited 
sample with 100/i _1 Mpc depth. In Fig. || we show how 
sparse sampling affects the surface functionals <I)\{An{t)) 
of the volume limited samples with 100/i _1 Mpc by only 
taking a fraction of the galaxies into account (the other 
functionals behave similarly). By reducing the number of 
galaxies the error increases and the mean value tends to- 
wards the value for a Poisson process. A similar tendency 
is seen in the sparser volume limited sample with depth 
200/i- 1 Mpc (see Fig. fjj). 

In view of these findings the following tests were per- 
formed: 

— We calculated the Minkowski functionals of a volume 
limited sample with 100ft- _1 Mpc depth but now with 
limiting flux equal to 2.0 Jy. Although the noise in- 
creases for larger radii, since fewer galaxies enter, the 
above mentioned features and the difference between 
northern and southern parts are clearly seen. 

— We selected "hot" galaxies, with a flux ratio /100//60 < 
1.5, "warm" ones with 1.5 < /100//6O < 3 and "cold" 
galaxies with /ioo//eo > 3; /100 and f 60 denote the 
flux at 100^ and 60^, respectively. We calculated the 
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Minkowski functionals of volume limited samples with 
100/i _1 Mpc taken from the "hot" (106 in the north 
and 116 in the south) and "warm" (239 in the north 
and 227 in the south) galaxies (only 7 (north) and 15 
(south) galaxies are "cold"). Again the error increases, 
but the distinct features between north and south are 
still discernible. With similar, but more refined criteria 
for distinguishing "warm" and "cool" galaxies, Mann 



et al. (1996) find only a small dependence of clustering 



Fig. 3. The surface functional <j)\ (Ajsr(r)) of a volume lim- 
ited sample with 100/i _1 Mpc depth for randomly drawn 
subsamples with 70% (top panel), 50% (middle panel), 
and 30% (bottom panel) galaxies. Again, dark shaded ar- 
eas represent the southern part, the medium shaded the 
northern part, and the dotted a Poisson process with the 
same number density. 



on the temperature in the QDOT survey. 

— We excluded local structures by looking at galaxies 
which have a distance from the galactic plane exceed- 
ing 20/i _1 Mpc. We also changed the window accord- 
ingly. The distinct features remain, but the significance 
decreases. 

Therefore, the morphometric difference between the 
northern and southern parts may not be attributed to 
luminosity effects, colour effects, or biases due to some 
peculiar local features. 

— By using volume limited subsamples we exclude quite 
a few galaxies in order to get "clean" samples for a 
reliable analysis with geometrical estimators. To check 
this dilution effect we generated a sequence of volume 
limited samples with limiting depths of 40/i _1 Mpc, 
60 ft-^Mpc, 80 h^Mpc, 100 h^Mpc, 150 ft^Mpc, 
200/i" 1 Mpc, and 300/i- 1 Mpc. 

Apart from the 300ft. - 1 Mpc sample, which is too strongly 
diluted to be significant, all these samples show significant 
differences between the northern and southern parts. 

— By cutting the volume limited sample with 100/i~ 1 Mpc 
depth perpendicular to the galactic plane, we obtain 
four smaller parts with approximately 160 galaxies 
each. All four samples show different clustering prop- 
erties. Therefore, the detected fluctuations cannot be 
attributed to a special north-south anisotropy. 



In Kerscher et al. (1996) we show plots of the Minkowski 
functionals of the samples incorporating the discussed se- 
lection effects. 



3.1.4. Comparison with a simulation 

To generate mock catalogues we use the results of a cos- 
mological simulation ( Kolatt et al. 199*6] ) kindly provided 
by Yair Sigad and Avishai Dckel. The simulation was 
performed for an — 1 universe with initial conditions 
given by a constrained realization of a Gaussian random 
field. The constraints were calculated from a density field 
constructed from the IRAS 1.2 Jy galaxy catalogue at 
present time, evolved backwards in time by inversion of 
the 'Zel'dovich approximation'. The simulation forward in 
time was performed using a particle-mesh code with 128 3 
particles, each comparable in mass to an average galaxy, 
in a box with side length 256/i _1 Mpc. The purpose of this 
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simulation is to mimick the current distribution of mat- 
ter and to serve as a testcase for statistical methods. We 
constructed a sequence of twenty mock catalogues both 
in real- and redshift-space from this simulation by taking 
only galaxies which are inside the domain of the sample 



Fig. 4. Minkowski functionals M of a volume limited 
sample with 100/i _1 Mpc depth and of mock samples, 
southern parts only. The light shaded areas represent the 
results in redshift-space, the medium shaded the results in 
real-space, and the dark shaded the results for the IRAS 
data. 



with 100/i _1 Mpc depth (see Fig. Al). By randomly select- 
ing these galaxies we make sure to have the same number 
density as observed in the data. Since we want to com- 
pare the morphological properties of the mock samples 
with the observed galaxy distribution in the northern and 
southern part separately, we choose the same orientation 
of the galactic plane as used for the constrained realization 
of the initial density field. 

In Fig. U we show the average values of the Minkowski 
functionals for the southern part of the mock samples 
both in real- and redshift-space in comparison with the 
Minkowski functionals of the volume limited observed 
sample with 100 h~ 1 Mpc depth. For the simulation the er- 
ror is found by averaging over twenty mock catalogues ex- 
tracted from the same simulation, for the data we plot the 
subsampling error. Both results, from real- and redshift- 
space, are consistent within their errors (the redshift- 
space results show larger scattering) , but they clearly dif- 
fer from the IRAS data. We attribute this to the small size 
of the simulation box (256/i -1 Mpc sidelength) compared 
to the extent of the mock samples (200ft. _1 Mpc diame- 
ter). The simulation enforces homogeneity on the scale of 
the sidelength of the box, suppresses fluctuations on that 
scale, and therefore is not able to reproduce varying clus- 
tering properties in distinct regions which nearly fill half 
of the box each. 

Obviously, we demand too much from the simulation 
which was designed to reproduce the density and velocity 
fields (and especially their relation) in the nearby region 
of the universe out to 60/i~ 1 Mpc. In order to cover fluctu- 
ations on the scale of 200ft.~ 1 Mpc, the simulation simply 
needs to be significantly larger. 

In the case of the northern part the results for the 
IRAS data and the mock catalogues agree (we do not show 
them here) . The real- and redshift-space results now show 
the same scattering. The consistency of real- and redshift- 
space results, both in north and south, gives us confidence 
that our analysis of the observed IRAS 1.2 Jy catalogue 
is not affected by rcdshift space distortions. 

3.1.5. Nearest neighbour statistics and a 2 

In Fig. (^J) we show the empty space function F(r), the 
nearest neighbour distribution G(r) and the J(r) statis- 
tics for the volume limited sample with 100/i _1 Mpc. The 
errors are estimated in the same way as for the Minkowski 
functionals. The error increases with the radius, since F 
and G are approaching unity, and (1 — G)/(l — F) diverges 
due to random fluctuations. 

The volume densities F(r) differ significantly between 
north and south. The lowered volume density in the south- 
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Fig. 5. u for the volume limited samples with 
100ft. _1 Mpc depth; again, the dark shaded areas repre- 
sent the southern part, the medium shaded the northern 
part, and the light shaded a Poisson process with the same 
number density. 



ern part on scales from 10 to 20/i _1 Mpc indicates stronger 
clustering than in the case of a Poisson process, while 
the northern part is marginally consistent with a Pois- 
son process. The nearest neighbour distributions of the 
northern and southern parts practically overlap, but dif- 
fer clearly from the Poisson process. On scales from 10 to 
15ft. _1 Mpc the J statistics discriminates between northern 
part, southern part and Poisson process, again showing 
stronger clustering (reduced J) in the south. 

In Fig. (||) we present the results for the galaxy count 
fluctuations in a volume limited sample with 100/i _1 Mpc 
depth. On the scale of 8/i _1 Mpc both parts display 
comparable fluctuations of counts in cells with erg := 
a(8h~ 1 Mpc) — 0.81 ± 0.06 (la error, southern part) and 
as = 0.73 ±0.07 (northern part). These directly measured 
values both agree with the as = 0.77±0.19 determined by 

and a 



Fisher ct al. (1994) in rcdshift space, using Eq. 
power-law fit to the correlation function. On larger scales 
the difference between north and south is clearly seen, as 
well as their deviation from a Poisson process. 



3.2. Volume limited samples with 200h 1 Mpc depth 

To look deeper into space we use a sample which is volume 
limited to 200/i -1 Mpc depth. The northern part then con- 
tains 139 galaxies, and the southern part 141 galaxies. The 
redshift and flux distribution are similar to those shown 
in Fig. | 






Fig. 6. F, G and J for the volume limited samples with 
100/i _1 Mpc depth; the dark shaded areas represent the 
southern part, the medium shaded the northern part, and 
the light shaded a Poisson process with the same number 
density. 
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3.2.1. Minkowski functional 

In Fig. the values of the Minkowski functionals of the 
southern and northern parts are plotted together with the 
functionals of a Poisson process with the same number 
density. As before, the errors are determined via subsam- 
pling with 90%. Again, the southern part shows a ten- 
dency towards stronger clustering, and the enhanced sur- 
face area is still clearly detectable, but the overall un- 
certai nty h as increased considerably. As pointed out in 
Sect. 3.1.3 the reduced deviations from the Poisson data 



r [h^Mpc] 
20 30 40 



should not be attributed to a homogenization of the un- 
derlying matter distribution on the scale of 200ft~ 1 Mpc, 
but rather are due to sparse sampling with only w 140 
galaxies in each part. 

3.2.2. Nearest neighbour statistics and a 2 

The nearest neighbour distribution G(r) and the J(r) 
statistics for the volume limited sample with 200ft~ 1 Mpc 
depth are shown in Fig. |[ The volume density, the near- 
est neighbour distribution, and the J(r) statistics of the 
northern part are consistent with a Poisson distribution, 
while the southern part differs in G(r) and J(r) both from 
a Poisson process and from the northern part. 

The galaxy count fluctuations (Fig. ^) for a volume 
limited sample with 200ft _1 Mpc depth show clearly a dif- 
ference between northern and southern parts, but only the 
southern part deviates from a Poisson process. 

Figure [Io| presents the average values of a 2 over both 
parts for volume limits of lOOft^Mpc and 200ft~ 1 Mpc 



together with the results of Oliver et al. (1996) and Fisher 
et al. (1994) . For the 200ft~ 1 Mpc sample our av eraged 



values are in good agreement with the results of Oliver 

2 



et al. (1996^ ; since a 1 differ betwee n the northern and 
southern parts, the errors quoted by |01iver et al . (1996 



are underestimated. This may be a result of their method: 
These authors calculate a 2 in shells, and use the mean 
number density in one shell in calculating the variance 
of the count in cells. Hence, they assume homogeneity by 
implicitly weighting with the selection function. 

We also plot <r 2 (r) according to Eq. (0) with the val- 
ues r = 4.27±g;^fe~ 1 Mpc and 7 = 1.68l£|f as given by 
Fisher et al. (1994)| for the volume limited sample with 
lOOft^Mpc depth for both northern and southern parts. 
These values were determined from a fit to the correlation 
function in the range from 1 to 13ft." ^pc. Within the 
quoted errors our results are consistent with the results of 
with a clear tendency towards lower 



(1994) 



Fisher et al 

values of a 2 on larger scales (compared to the 100ft" x Mpc 
sample) . 

The a 2 determined from the volume limited samples 
with 100ft _1 Mpc and 200ft _1 Mpc differ. This variation of 
o~ 2 for the 1.2 Jy data reveals a dependency of a 2 on the 
mean number density n and on the depth of the sam- 
ples (n = 1.84 x 10~ 4 (ft^Mpc)" 3 for the sample with 




Fig. 7. Minkowski functionals </> M (.Ajv(r)) of a volume 
limited sample with 200ft _1 Mpc depth; the dark shaded 
areas represent the southern part, the medium shaded the 
northern part, and the light shaded a Poisson process with 
the same number density. 
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Fig. 9. a 2 for the volume limited samples with 
200/i _1 Mpc depth; again, the dark shaded areas repre- 
sent the southern part, the medium shaded the northern 
part, and the light shaded a Poisson process with the same 
number density. 
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Fig. 10. The mean values for a 2 for the volume lim- 
ited samples with 100/i _1 Mpc and 200fr~ 1 Mpc depth; the 



dark shaded area marks the result of Oliver et al. (1996) 



the medium shaded the mean result for the 100/i _1 Mpc 
sample, and the light shaded the mean result for the 
200/i~ 1 Mpc sample. The dashed line gives the results for 
cr 2 as in Eq. (§) with r = 4.27fe~ 1 Mpc and 7 = 1.68 



according to Fisher et al. (1994) 



Fig. 8. F, G and J of a volume limited sample with 
200/i~ 1 Mpc depth; the dark shaded areas represent the 
southern part, the medium shaded the northern part, and 
the light shaded a Poisson process with the same number 
density. 



100/i _1 Mpc depth and n = 9.05 x 1(T 6 (/i^Mpc)" 3 for 
the sample with 200ft. _1 Mpc depth). In Appendix B we 
show for a point distribution on a fractal, as a simple still 
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tractable model, that a 2 depends on the mean number 
density n and the depth of the sample. 

4. Morphometry of a point set 

In previous sections, we applied Minkowski functionals as 
morphological tools to analyze spatial patterns displayed 
by galaxy catalogue data. Here, we comment on the ra- 
tionale of this method, which became established only re- 
cently in the cosmology literature. 

Basically, a galaxy catalogue is a finite set of triple- 
numbers X = {xi}^ representing a skeleton pattern of 
points imbedded in a domain D of a three-dimensional 
Euclidean space. The geometrical information supplied by 
this skeleton may be organized in a collection of particle 
distributions, 

T n (y u ...,y n \D) , l<n<N, (9) 

with 

r n ( yi) ...,y n |D) := 

E Sfri-i* 1 )...6(y n -x in ), (10) 

(il,— ,in) 

where the sum runs over all n-tupels (ii,... ,i n ) with 
l<ii< . . . <i n <N , and each yj is chosen to lie within D. 

What is the "form" of this point set? Visual inspection 
habitually conveys the impression of distinct structural 
features such as clumps, voids, "Great Walls", "chess- 
board universe" , or even "Fingers of God" . However, these 
subjective impressions may be deceptive. To associate a 
form with a point set, we must put flesh on the skeleton. 
We have done this in a technically controllable way by 
covering the points with copies of a ball B r to create a 
body An(t) = U^= B r (xj), where the common radius r 
may be employed as a variable diagnostic parameter. 

The Minkowski functionals are directly related with 
familiar geometrical and topological quantities (see Table 
|l|) and are ideally suited to measure content, shape, and 
connectivity of bodies such as An(t). In order to explicate 
their connection with the catalogue data, let us write 

N 

A N {r) = (J B r (* t ) = A N -i(r) U Br(x N ). (11) 

One of the basic properties of the functionals M Al (-) is 
additivity which implies, in particular, 

M^A N (r)) = M„(B r (x N )) + M^{A N -x{r)) - 

M^A N ^{r)nB r {x N )). (12) 

By iterating Eq. (|l2|), we ultimately arrive at 
M^(A N (r)) = 

N 

£(-l) n_1 £ M„( Xil ,..., Xin ;r), (13) 



where we have set M (tt (x^ 1 , . . . , x^ n ; r) = M /1 (B r (xi 1 ) n 
• • • fl S r (xj ra )). In terms of the particle distribution r„, 
the expression ( |l3| ) can be written as 

M^Awir)) = El- 1 )™" 1 / dV" / d 3 x n 

M m (xi,. . . ,x n ;r)r n (x 1 , . . . ,x n \D). (14) 

Evidently, the values M^Ajyfr)) of the Minkowski func- 
tionals are determined uniquely by the input data encoded 
in the distributions T„, once a covering is chosen. Equa- 
tion ( |i2|) remains valid when the balls are replaced by 
congruent copies of some other compact convex set; how- 
ever, the values -M^(-) for a given catalogue vary with the 
choice of the coverage. 

We note that the present approach is purely combi- 
natorial; no statistical assumptions enter (beyond those 
involved in the construction of the sample catalogue) , and 
the problem of "fair samples" does not arise. 

Let us compare two samples (1) and (2) with the num- 
ber of galaxies TViSsA^ in a common domain. If we find 
the values of M^(A { ^\(r)) and M^A^r)) to differ over 
some finite range of values of the radius r then the sam- 
ples are certainly distinct morphologically. For instance, 
the V-shaped spike in the normalized Euler character- 
istic $3, seen in Fig. ^, is not a random deviation due 
to some poor statistics but provides definitive evidence 
for higher connectivity (i.e. higher genus) of the southern 
covering body in comparison with the northern one when 
r w 46/i _1 Mpc. 

5. Summary and Conclusions 

We analyzed morphological characteristics of the galaxy 
distribution described by the IRAS 1.2 Jy catalogue. The 
two subsamples (north and south) of this catalogue were 
studied individually by employing three different methods: 
(i) Minkowski functionals, (ii) nearest-neighbour distribu- 
tion, and (iii) variance in the galaxy number counts. 

Since the IRAS data have been obtained from a single 
instrument with uniform calibration, the two subsamples, 
which contain about the same number of galaxies, can be 
compared reliably. 

To assess the significance of our results we took some 
care to account for selection and finite size effects. For 
reference purposes we used typical realizations of a sta- 
tionary Poisson point process. 

Our results may be summarized as follows: 

— The values of the Minkowski functionals for the south- 
ern part differ significantly from those for the northern 
part. This morphological segregation is seen in the vol- 
ume limited subsamples both with 100/i _1 Mpc depth 
and with 200/i~ 1 Mpc depth, but it is less pronounced 
in the latter case because of sparse sampling. The 
Minkowski values of the southern part in particular 
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exhibit drastic deviations from the reference Poisson 
samples. 

— The structural difference between north and south is 
also clearly detectable both in the nearest-neighbour 
distribution and in the variance of galaxy number 
counts. The clustering in the south is recognizably 
stronger than in the north. 

A similar anisotropy in the angular distributions of 
IRAS galaxies around the northern and southern galactic 
poles has already been reported by Rowan-Robinson et 
al. (19 56) and was confirmed by 



Lonsdale et al. (1990) 



However, the majority of previous IRAS catalogue studies 



(e-g- 
1991 


Bouchet et al. 1992, 


Fisher et al. 1994 




Kaiser et al. 




)liver et al. 1996, Protogeros and Weinberg 1997, 


Yess et al. 1997 


) focused attention on the complete sample 



without addressing the distinction of its constituent parts. 

There is no reason to assume a distinguished position 
of the Milky Way galaxy; we therefore conclude that fluc- 
tuations in the global morphological characteristics of the 
IRAS sample extend over length-scales of 200/i~ 1 Mpc, at 
least. These fluctuations may originate from dynamical 
correlations in the matter distribution which arise during 
cosmic evolution. 

Fluctuations occuring on scales up to 200/i~ 1 Mpc, at 
least, imply that cosmological simulations which enforce 
homogeneity on the scale of a few hundreds of /i _1 Mpc 
and suppress fluctuations on larger scales by using pe- 
riodic boundary conditions cannot reproduce the large- 
scale fluctuations indicated by the present analysis of the 
1.2 Jy catalogue. This assertion is confirmed by our com- 
parison with IRAS mock catalogues drawn from simula- 
tions of 256ft. - 1 Mpc box-length. 



Recently Davis (1996) used the flux limited 1.2 Jy 
galaxy catalogue without volume limitation and looked at 
galaxies in different redshift intervals, projected onto the 
sphere. He found strong anisotropics in the number den- 
sity in the nearby region but less in the deeper regions and 
claims to see a cross-over towards a random homogeneous 
distribution on scales well below the sample size. However, 
our results underline that the visual impression of homo- 
geneity on larger scales, especially if based essentially on 
the number density, is indeed deceptive and misleading. 



During the last few years, 


Pictroncro et al. (1996), see 


also [3ylos Labini et al. (1997) 


or Coleman and Pictroncro 



logue data in terms of a fractal support of the galaxy dis- 
tribution. By its nature, a pure fractal indeed exhibits fluc- 
tuations in iV-point distributions on all scales. Our results 
neither support nor contradict this interpretation, since 
the Minkowski functional, as employed in the present pa- 
per, are global measures and are not designed to discrim- 
inate local structures of spatial patterns. 

It will be highly interesting to repeat the present anal- 
ysis for the upcoming PSCz catalogue, and for an optically 
selected survey. Forthcoming work will also focus on the 



analysis of mock catalogues and cosmic variance in large 
(> lft _1 Gpc) simulations. 
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Appendix A: Boundary corrections 

This appendix summarizes boundary corrected estimators 
for the Minkowski functionals M^, the nearest neighbour 
distributions F and G and for the fluctuations of counts in 
cells <7 2 . All estimators are unbiased only for a statistically 
"fair" sample and a stationary point process. However, 
we use identical window geometries for the northern and 
the southern parts, so we may still compare the two parts 
without making these assumptions. Moreover the observed 
discrepancies can be used to falsify at least one of them. 



A.l. Boundary correction for Minkowski functionals 

Integral geometry offers a concise way of dealing with con- 
tribu tions of the sample geometry ( Mecke and Wagnci 
1991 ). In Schmalzing et al. (1996) we discuss a method 



for completely removing boundary effects. 

Let D be the window, i.e. the sample geometry through 
which we look at the galaxies. First of all, we must shrink 
the window D by the radius r, otherwise we would miss 
contributions fro m balls situated outside the sample (as 
shown in Fig. Al). The shrunken window is D r := DUB r . 

We calculate the Minkowski functionals M Al (Z? r ) of the 
window itself, and the Minkowski functionals M M (An (r) H 
D r ) of the intersection of the window with the union of 



all balls (see Fig. (Al)). From these numerically measured 
values, the volume densities m^(A r ) of the Minkowski 
functionals of the union set without the window are ob- 
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-loob'Mpcor ;; 

200h 1 Mpc 



Fig. Al. A cut through the union set of balls .Ajv(r), 
within the shrunken window D r . 



tained by the recursive formula]] 



M^ANjr) nD r ) 
M (D r ) 



E 

v=0 



(Al) 



Equation (Al) is just a linear transformation of the func- 



tionals M /1 (.Ajv(r) n D r ) calculated for o ne realization o f 



the point process. Hadwiger's Theorem ( Hadwiger 1957 ) 



asserts that the m^A/v^)) are a complete set of morpho- 
logical descriptors equivalent to M^An^) n D r ). What 
is more, the mean values of m At (^4jv(r)) are analytically 
known for the Poisson process, and can be directly com- 
pared to the results in Eq. (0). Finally, for a fair sample of 
a stationary point process they are unbiased estimators of 



volume densities of the Minkowski functionals (Fava and 
Santaltfr 19781; [Fava and Santalo 197§), 



A. 2. Boundary correction for nearest neighbour statistics 

Ripley (1988) gives several estimators for F and G appli- 
cable to spatial point processes with boundaries. In our 
work we exclusively use the minus estimators (also called 
reduced sample estimators) F and G. 

The definition uses the distance p(y, X) of a point y 
to the nearest point of the pattern X — {iti}fLi, 

p(y,X) := mindlxi -y|| | x.eX and x; ^ y), (A2) 

and the distance p(y, D) of a point y to the boundary of 
the window D, 



p(y,D) :=min(||z-y|| | z D) 



(A3) 



As explained in Sect. |2.2| , F(r) gives the probability of 
finding a point of the pattern X within a distance r around 

1 We use the convention ~%2~2 ai = to include the case 
a = 0. 



a randomly selected point y. Therefore we estimate the 
proportion of points y £ D with p(y, X) < r. In order to 
obtain an unbiased estimator, y has to keep a distance of 
at least r from the window's boundary, i.e. r < p(y,D). 
Therefore, let {yj}jLi be M randomly distributed points 
yj G D r inside the shrunken window D r . Then F(r) is 
defined by 



F(r) 



#{ yj \p( yj ,X)<r)} 
M 



(A4) 



where . . } gives the number of points in the set {■•■}• 
Note that Eq. (A4) is nothing but a Monte Carlo inte- 
gration of the volume An{t) H D r . This is a direct conse- 
quence of the equality Fo(r) = mo(r) mentioned earlier. 

To estimate G(r ) we also use a reduced sample esti- 
mator ( Ripley 1988 ). Since G(r) gives the probability of 
finding another point of the pattern X within a distance 
r around a point x £ X, we simply replace the random 
sample used in Eq. (A4) by the pattern X. The minus 



estimator G{r) is then defined by 



G(r) = 



#{xj- | p(^,X) <r <p(xj,D)} 
#{xj | r<p(xj,D)} 



(A5) 



A. 3. Boundary correction for galaxy count fluctuations 

For estimating o~ 2 (r) as defined in Eq. (|^), we randomly 
throw two million balls B r , count the number of points 
in each and calculate the variance. In order to take care 
of the boundaries we only choose balls lying completely 
within the window D. This means their centres have to 
fall into the shrunken window D r . 



Appendix B: <j for a fractal, and dependence on 
n 

Figure [l(] reveals differences of a 2 between the 100/i _1 Mpc 
and the 200/i _1 Mpc samples. Although some of the results 
are still consistent given the relatively large errorbars, an- 
other explanation of the differences is possible. In this ap- 
pendix we calculate the dependence of a 2 on the depth Rq 
and the number of points for a sample drawn from a 
fractal distribution of galaxies. 

The generalization of a 2 to point processes on fractal 
sets is not straightforward, since the mean number density 
n is not well-defined. In the definition given in Eq. @ we 
have to replace the number of points in a ball of radius r 



by N{r) instead of nV{B r 



a 2 (r) 



{{N t - N{r)) 2 ) - N(r) 
N 2 (r) 



(Bl) 



For a fractal of correlation dimension D this quantity de- 
pends on the radius through 



> ( f ^ 



(B2) 
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where the constant of normalization Nq/Rq is given for 
a spherical sample of radius Rq containing No galaxies. 
The generalization to cones and an application to number 
counts is discussed in 3ylos Labini et al. (1996). With 
Eq. (B2) plugged into the definition in Eq. (pl|) we obtain 



a 2 (r) 



(Ni) (R 



N 2 



2D 



1 

N 



— ~ — "I 



Ro 

r 



D 



(B3) 



Obviously a 2 depends on N and Ro nontrivially if D < 3. 
For a stationary process with D = 3 the average density 
n := No/Vo — N /(^-Rq) is independent of the sample 
size and we have 



a 2 (r) = (N 2 ) (nf r 3 )~ 2 - {nf r 3 )' 



1 



(B4) 



as a function of the mean number density n. Since (•) 
is defined as a volume average, (N 2 } determined for one 
realization may differ from an average over several real- 
izations, i.e. universes. If one assumes ergodicity of the 
point distribution (implying homogeneity), the volume 
average converges towards the ensemble average. In this 
case we get for example for a Poisson process (N 2 ) = 



(nfr 3 ) 2 + 



and hence cr 2 (r) = 0. 
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